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I report on the statistical analysis of a sample of about 100 AGN with simultaneous UV and X- 
ray observations with Swift. I found clear correlations between the X-ray spectral slope ax, the 
UV slope any, and the optical-to-x-ray spectral slope ttoxwith the Eddington ratio L/Lem- I also 
report on the bolometric corrections for io.2-2.0keV and L5100. A major aspect of the statistical 
analysis is multi-variate analysis statistical tools such as the Principal Component Analysis (PCA) 
and cluster analysis. This analysis shows that the main driver of the AGN properties in this sample 
is the Eddington ratio L/Lem- Although separating Seyfert Is into NLSls and BLSls is a good 
classification, with the 2000 km s^' cutoff line it is still arbitrary. A better classification scheme 
may be to separate AGN into low and high L/Leu AGN as suggested from the cluster analysis. 
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1. Introduction 

After the milestone paper by Narrow Line Seyfert 1 galaxies (NLSls) have become an 
important part in AGN research. Their extreme properties place them at one end of the AGN pop- 



ulation, often known as the eigenvector 1 relation in AGN (e.g. [Q, ^ [14|, [18|]). This eigenvector 1 
is typically associated with the Eddington ratio L/L^dd which seem to drive the properties of AGN. 
L/Lndd is therefore a crucial parameter to describe AGN. However, only for a small fraction of 
AGN it is possible to directly determine L/L^dd- For the majority of AGN we rely on secondary 
methods like a bolometric correction (e.g. |2]|) which calculate the bolometric luminosity based 
on the luminosity at the specific wavelength. 

In these proceedings I discuss the bolometric correction based on simultaneous UV and X-ray 



observations obtained by Swift. This study was performed on a sample of 92 AGN [ 23 1 which 
will also show how spectral slope can be used as an estimator of L/Ledd- When [ p5| ] and [11] 
described NLSls, they defined their definitions purely phenomenologically. After more than 25 
years, is the definition of NLSls just based on their FWHM(Hj3)<2000 km s^' still valid, or do 
we have to extend this definition? Here multi-variate analysis techniques come into play. With 
these techniques we can put the definition on a more soUd statistical basis. 

2. NLSls in the Literature 

When I gave a talk on NLSl statistics at the 1999 Bad Honnef NLSl meeting, I started out 
showing how NLSls had become more and more popular over the previous decade and the number 
of publications with the word(s) "NLSl" or "Narrow-Line Seyfert 1" in their title had grown expo- 



nentially [15]. Are NLSls still relevant in the literature today (March 2011) and more importantly, 
are NLSls still an active research field? Figure |] displays the numbers of papers per year have the 
name "NLSl" or "Narrow-Line Seyfert 1" in their title or abstract. The red triangles display the 
refereed publications and the blue circles show all publications. The number of NLSl publications 
peaked at around 2001/2002, but still is at a level of roughly 100 publications per year including 
50-60 refereed articles. Still, NLSls are a very active field in the AGN community. 

Another support for this statement is that at the 1999 Bad Honnef NLSl meeting we had 60 
participants. Of these only 6 are at this 201 1 meeting in Milan which has more than 80 participants. 
So, in other words, NLSls have attracted a lot of new people in the field. As shown in the articles 
in these proceedings, the field of NLSls has really developed dramatically over the past ten years 
and we do have a much better understanding now in 201 1 what the place of NLSls in the Universe 
is than we had in 1999. 



3. Sample Selection 



Our AGN sample is selected from the ROSAT bright soft X-ray AGN sample by [|1^, |17|] 
which contains a total of 110 AGN. There are several advantages of this particular AGN sample. 
First, all AGN in the sample are not only X-ray bright, but they are also bright in the Optical/UV 
which allows us to obtain high quality data in a relatively short amount of observing time. All of 
the AGN are intrinsically unabsorbed in X-rays and do not show high reddening in the UV. By 
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Figure 1: Papers per year which have the name "NLS 1 " or "Narrow-Line Seyfert 1 " in their title or abstract. 
The red triangles display the refereed publications and the blue circles show all publications. 



definition, all of tliese AGN have at least one ROSAT observation which allows studies of their 
long-term variability. We also obtained optical spectroscopy data for all AGN [|l^] which enable 
us to perform multi-variate statistical analysis between emission line and continuum properties. 

Of the 110 AGN, 92 had been observed by Swift by January 2010 and the results of these 
observations were published in [|^]. By Mid-2011 this sample has been observed almost entirely. 
Half of the AGN in the sample are NLSl. Almost all of the AGN have been observed in Swift's 



UV/Optical telescope (UVOT; []27[]) in all 6 filters, which allows us to determine the Optical/UV 
slope ttuv and the majority of the AGN has been visited by Swift multiple times, which gives us 
a handle on the variability in these sources. Many of these AGN show very strong variability in 
X-rays as well as in the UV [p3|], which makes it necessary to observe the Optical/UV and X-ray 
bands simultaneously when studying the spectral energy distributions of AGN. The data analysis 



of the Swift observations of the AGN in the sample is described in [23]. 
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4. Swift 



The Swift Gamma-Ray Burst observatory was launched in November 2004 [|T0[]. While its 
prime objective was to detect and follow up on Gamma-Ray bursts, Swift has turned into a multi- 
purpose observatory due to its fast slewing and response capacity and its multi-wavelength cover- 
age. As shown in Figure ||, Swift is equipped with three telescopes, the Burst Alert Telescope (BAT; 
[|l|]) which covers the 15-150 keV range, the X-ray telescope (XRT; |^]) covering the 0.3-10 keV 



energy band, and the UV/Optical Telescope (UVOT; [[27||) covering the 1800-6000A wavelength 
range. The simultaneous X-ray and UV coverage makes Swift and ideal observatory for AGN. 





Figure 2: The Swift Gamma-Ray Burst Explorer Mission is equipped with the BAT, XRT and UVOT. 
(Credit: NASA/GSFC) 



5. Simple correlation analysis 



The full simple statistical analysis of the Swift AGN sample can be found in [|23|]. In these 
proceedings we only show a few of the correlations found among the sample. 

As mentioned in the introduction, one way to determine the bolometric luminosity and there- 
fore L/Lndd is through bolometric corrections. From our sample we found the following bolometric 
corrections for the luminosity at 5100A and the rest-frame 0.2-2.0 keV band: 

logLboi = (1.32±0.06)xlogL5ioo - (10.84±2.21). (5.1) 

logLboi = (1.23 ±0.06) X log Lx - (7.36±2.01). (5.2) 
These relations are displayed in Figure 17 in [p3|]. 

In particular we were interested in relations between L/L^dd and spectral slopes. Figure || 
displays the relation between the X-ray spectral slope and L/L^dd- This is a strong correlation 
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which is actually dominated by the Broad Line Seyfert 1. NLSls with the L/Ledd at unity do not 
show a dependence of the X-ray spectral slope with L/Ledd- The relation between L/L^dd and oCx 



found among the Swift AGN sample confirms earUer results by [|7| using ROSAT results and p.% 
using the 2-10 keV spectral slopes that high L/L^dd AGN have steeper X-ray continua than low 
L/LEdd AGN. 

Figure shows the anti-correlation between L/L^dd and the Optical/UV spectral slope ttuv- 
AGN with higher L/L^dd display bluer optical/UV spectra. Note that there is an offset between 
NLSls and BLSls with NLSls showing higher Eddington ratio for a given UV spectral slope than 



BLSls [23]. We also notices a strong correlation between L/L^dd and the optical to X-ray spectral 
slope ttox^ as shown in Figure S. AGN with higher L/L^m appear to be X-ray weaker [23|. 
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Figure 3: Correlations between the Eddington ratio L/Lndd and the X-ray spectral slope ax- NLSl are 
displayed as blue triangles and BLSls as red squares. 



'The X-ray loudness is defined by as O!ox="0.384 log(/2kev //250o)- 
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Figure 4: Correlations between the Eddington ratio L/L^m and the Optical/UV spectral slope auv- NLSl 
are displayed as blue triangles and BLSls as red squares. 



6. Multi-variate Analysis 

Instead of only looking for linear relations between measured parameters in a sample, multi- 
variate analysis examines the full data set in multiple dimensions. Multi-variate analysis looks 
into the n-dimensional parameter space and is searching for underlying properties/components that 
drives the observed parameters. It also searches for groups/clusters in this parameter space and by 
inter-weaving the different methods is able to find new relations and classes within the data set. 

6.1 Statistical software R 

All of the multi-component statistical analysis methods shown here used the statistical package 
R (e.g. ^]). R is an object-oriented language for statistical data analysis. It is an extremely 
powerful package and unlike commercial statistical software packages, R is a free package that is 
continuously maintained and developed ^. R provides a huge library of statistical analysis packages 

^http://cran.r-project.org 
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Figure 5: Correlations between the Eddington ratio L/Lgcid and optical to X-ray spectral slope Oox- NLSl 
are displayed as blue triangles and BLSls as red squares. 



that can be used for astronomical purposes. 

6.2 Principle Component Analysis 

The Principal Component Analysis (PCA) is a relatively old statistical tool, already developed 
more than 100 years ago (Pearson 1901). The idea of the PCA is to reduce the number of parameters 
in a sample to a small number of significant parameters that capture most of the variance in the data. 
For example when we look at the parameters that can be measured from stars they can basically 
be described by their mass and metallicity. In a mathematical sense, the PCA searches for the 
eigenvalues and eigenvectors in a correlation coefficient matrix: 

A X X = ?i X X (6.1) 

where A is the correlation coefficient matrix, A an eigenvalue and x an eigenvector. In a 
geometrical sense, the PCA performs an axis transformation for the cloud of data points in this n- 
dimensional space (n input parameters/properties). The first principal component is the eigenvector 
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which direction is along the longest elongation of this cloud. The second eigenvector is the vector 
perpendicular this this one in the direction of the second largest elongation of the data cloud, and so 
forth. A good description for the application of a PC A in astronomy can be found in ^. Always 
keep in mind that a PCA is a purely mathematical method that can but does not necessarily result 
in physically meaningful parameters. 

pea 



p 



Figure 6: Strength of the eigenvectors from the Principal Component Analysis of our AGN sample. The 
columns represent each eigenvector starting with eigenvector 1 at the left. 

When applying the PCA to the AGN sample by using ax, ttuv, oCox, FWHM(Hj8), 
FWHM([OIII]), [0III]/Hj8, FeII/Hj8, and Lo.2-2.0keV as input parameters we found that the first 
two eigenvectors already account for 64% of the variance in the sample. Figure^ displays the 
strengths of the eigenvectors. A stronger eigenvector 1 results is steeper X-ray spectra, bluer opti- 
cal spectra, steeper ttox- We also found the well-known eigenvector 1 [OIII] and Fell Boroson & 
Green [||] relation. These relations suggests that eigenvector 1 is associated with L/Lndd- Figure ^ 
displays the relation between eigenvector 1 and L/Lndd and indeed it shows there is a very strong 
correlation between these two properties. Looking at eigenvector 2, we interpret this eigenvector 
with the mass of the central black hole. 

6.3 Cluster Analysis 

The other multi-component technique is cluster analysis. Cluster analysis is a purely statisti- 
cal/mathematical tool that allows an unbiased search for groups in multi-dimensional data sets. In 
its simplest form, the cluster analysis will search for the nearest neighbor in the parameter space. 
It will then group these neighbors with other groups of neighbors and so on. This is also called 
hierarchical clustering (e.g. [^). At the end we will get a so called dendrogram such as displays 
for our AGN sample in Figure ^ Clustering is a powerful tool for data mining. It can and will find 
groups and correlations that are otherwise missed with simple analysis and classifications tools. A 
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Figure?: Eigenvector! vs. L/Lndd, again NLS Is are displayed as blue triangles and BLS Is as red squares. 

consistency-check of the results of the cluster analysis can be done again through the PCA. The 
groups found in the cluster analysis should also be found as groups in the eigenvector 1 - eigenvec- 
tor 2 diagram from a PCA. 

Figure ^ displays the dendrogram for our AGN sample using the sample input parameters as 
used for the PCA. Clearly the AGN can be separated into two classes. By itself, cluster analysis is a 
purely mathematical method and these groups may not mean anything. However, let's have a look 
what the properties of the two groups are. Group 1 shows steeper X-ray spectra, bluer optical/UV 
continua and steeper optical to X-ray spectral slopes ttox- It also contains mostly objects with 
narrower HjS lines and higher L/LEdd Eddington ratios. The AGN in group 1 are those with the 
strongest Fell and weakest [OIII] emission. Does this mean that group 1 contains mostly NLS Is 
and group 2 BLS Is? Let's have at first a look at the eigenvector 1 and eigenvector 2 diagram as 
we found it from the PCA. This graph is shown in Figure ^. The members of group 1 in this plot 
are shown as (blue) triangles. These are the objects with high eigenvector Is and as we have seen 
before in the PCA, these are the objects with high L/Lndd- While 62% of the members in group 1 
are NLS Is, only 20% of the members of group 2 are NLS Is, which is more the typical fraction of 
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Figure 8: Dendrogram of the hierarchical cluster analysis of the AGN sample. Clearly the AGN sample 
can be divided into two groups. 

NLSls found among AGN. 

Finally we can ask if the cutoff line between NLSls and BLSls at 2000 km s~^ is still a valid 
definition. The answer is yes and no. No, because the 2000 km s^^ definition is purely arbitrary 
and there are many sources with NLSl-like properties, but which do have broader FWHM(Hj3) 
than the 2000 km s^' definition. Yes, because a majority of NLSls fall into our group 1. All in 
all, NLS 1 is still a valid class/definition, but we should be more flexible about the cutoff line. We 
should generally speaking talk more about high L/L^dd AGN because this definition is based on a 
more statistical and physical definition. 

7. Discussion and Conclusions 

Although NLSl research is now more than 25 years old, it is still an active field in AGN 
research with still creates a lot of interest. It remains to be a powerful research area which gener- 
ates more than 50 refereed publications each year. To make it clear, NLSls are not an historical 
accident! NLSls are extreme AGN, but they are needed for our interpretation of the AGN phe- 
nomenon. 

Extending our statistical study into n-dimensional parameter space by using multi-variate tools 
like the PCA and cluster analysis shows that AGN separate into high and low L/L^ objects. The 
high L/LEdd sample is mostly represented by NLSls. Although the 2000 km s~^ cutoff line between 
NSLls and BLSls is somewhat arbitrary, NLSl still remains to be a valid AGN class. 
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Figure 9: Eigenvector 1 vs. eigenvector 2 from the PCA using the two main groups found by the cluster 
analysis. Group 1 is displayed as blue triangles and group 2 as red squares. 
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